Source quality
Characteristic Name: | Source quality |
Dimension: | Reliability and Credibility |
Description: | Data used is from trusted and credible sources |
Granularity: | Information object |
Implementation Type: | Process-based approach |
Characteristic Type: | Usage |
Verification Metric:
The number of tasks failed or under performed due to lack of source quality |
The number of complaints received due to lack of source quality |
The implementation guidelines are guidelines to follow in regard to the characteristic. The scenarios are examples of the implementation
Guidelines: | Scenario: |
---|---|
Asses the reputation of data sources | (1) Central Bank is the best source to get daily exchange rates |
Evaluate the remedies for non-compliance of data | (1) Any remedies given by the source organisation to mitigate the losses in case if the information is of low quality |
Rely on shared information sources created\recommended\used by the organisations operating in the industry | (1) In performing portfolios analysis most organisations use the risk factors produced by a central body of the economy (Central bank) |
Validation Metric:
How mature is the process to maintain quality of data sources |
These are examples of how the characteristic might occur in a database.
Example: | Source: |
---|---|
Consider an inventory database that contains part numbers, warehouse locations, quantity on hand, and other information. However, it does not contain source information (where the parts came from). If a part is supplied by multiple suppliers, once the parts are received and put on the shelf there is no indication of which supplier the parts came from. The information in the database is always accurate and current. For normal inventory transactions and deci- sion making, the database is certainly of high quality. If a supplier reports that one of their shipments contained defective parts, this database is of no help in identifying whether they have any of those parts or not. The database is of poor quality because it does not contain a relevant element of information. Without that information, the database is poor data quality for the intended use. | J. E. Olson, “Data Quality: The Accuracy Dimension”, Morgan Kaufmann Publishers, 9 January 2003. |
The Definitions are examples of the characteristic that appear in the sources provided.
Definition: | Source: |
---|---|
The source of information (1) guarantees the quality of information it provides with remedies for non-compliance; (2) documents its certification in its Information Quality Management capabilities to capture, maintain, and deliver Quality Information; (3) provides objective and verifiable measures of the Quality of Information it provides in agreed-upon Quality Characteristics; and (4) guarantees that the Information has been protected from unauthorized access or modification. | ENGLISH, L. P. 2009. Information quality applied: Best practices for improving business information, processes and systems, Wiley Publishing. |
The notion of abstracting information into a data domain implies that there are enough users of the same set of data that it makes sense to manage their own versions. The dimension of enterprise agreement of usage measures the degree to which different organizations conform to the usage of the enterprise data domain of record instead of relying on their own data set. | LOSHIN, D. 2001. Enterprise knowledge management: The data quality approach, Morgan Kaufmann Pub. |
Reputation is the extent to which data are trusted or highly regarded in terms of their source or content. | SCANNAPIECO, M. & CATARCI, T. 2002. Data quality under a computer science perspective. Archivi & Computer, 2, 1-15. |
The degree of reputation of an information object in a given community or culture. | STVILIA, B., GASSER, L., TWIDALE, M. B. & SMITH, L. C. 2007. A framework for information quality assessment. Journal of the American Society for Information Science and Technology, 58, 1720-1733. |
Data are trusted or highly regarded in terms of their source and content. | WANG, R. Y. & STRONG, D. M. 1996. Beyond accuracy: What data quality means to data consumers. Journal of management information systems, 5-33. |
Continuity of data access
Characteristic Name: | Continuity of data access |
Dimension: | Availability and Accessability |
Description: | The technology infrastructure should not prohibit the speed and continuity of access to the data for the users |
Granularity: | Information object |
Implementation Type: | Process-bases approacd |
Characteristic Type: | Usage |
Verification Metric:
The number of tasks failed or under performed due to the lack of continuity in data access |
The number of complaints received due to lack of continuity in data access |
The implementation guidelines are guidelines to follow in regard to the characteristic. The scenarios are examples of the implementation
Guidelines: | Scenario: |
---|---|
Convenient and efficient platform should be made available to access data depending on the task at hand | (1) For a sales person, a web based interface run on a smart device is more suitable to quickly access data |
Speed of the data retrieval should be acceptable for users working pace | (1) For an online customer care executive, speedy retrieval of information is necessary since the customer cannot be kept waiting (2) With the growth of the database reports become slower (Anti example) |
Continuous and unobstructed connectivity should be ensured for data retrievals | (1) Connection lost while accessing reports (Anti example) |
Proper concurrency control has been implemented | (1) Controlling access to data by locks |
Technological changes in the infrastructure/system should be handled in such a way that they should not make data inaccessible | (1) New version of the software does not provide access to " X out orders" since the new version does not allow the function "X out" |
Validation Metric:
How mature is the process of maintaining an infrastructure for data access |
These are examples of how the characteristic might occur in a database.
Example: | Source: |
---|---|
1) For example, recording the age and race in medical records may be appropriate.
However, it may be illegal to collect this information in human resources departments. 2) For example, the best and easiest method to obtain demographic information may be to obtain it from an existing system. Another method may be to assign data collection by the expertise of each team member. For example, the admission staff collects demographic data, the nursing staff collects symptoms, and the HIM staff assigns codes. Team members should be assigned accordingly. |
B. Cassidy, et al., “Practice Brief: Data Quality Management Model” in Journal of AHIMA, 1998, 69(6). |
The Definitions are examples of the characteristic that appear in the sources provided.
Definition: | Source: |
---|---|
1) Is there a continuous and unobstructed way to get to the information?
2) Can the infrastructure match the user’s working pace? |
EPPLER, M. J. 2006. Managing information quality: increasing the value of information in knowledge-intensive products and processes, Springer. |
Data is easy and quick to retrieve. | PRICE, R. J. & SHANKS, G. Empirical refinement of a semiotic information quality framework. System Sciences, 2005. HICSS'05. Proceedings of the 38th Annual Hawaii International Conference on, 2005. IEEE, 216a-216a. |
1) availability of a data source or a system.
2) Accessibility expresses how much data are available or quickly retrievable. 3) The frequency of failures of a system, its fault tolerance. |
SCANNAPIECO, M. & CATARCI, T. 2002. Data quality under a computer science perspective. Archivi & Computer, 2, 1-15. |